18 research outputs found

    Annotating honorifics denoting social ranking of referents

    Get PDF
    This paper proposes an annotating scheme that encodes honorifics (respectful words). Honorifics are used extensively in Japanese, reflecting the social relationship (e.g. social ranks and age) of the referents. This referential information is vital for resolving zero pronouns and improving machine translation outputs. Annotating honorifics is a complex task that involves identifying a predicate with honorifics, assigning ranks to referents of the predicate, calibrating the ranks, and connecting referents with their predicates

    Annotating honorifics denoting social ranking of referents

    Get PDF
    This paper proposes an annotating scheme that encodes honorifics (respectful words). Honorifics are used extensively in Japanese, reflecting the social relationship (e.g. social ranks and age) of the referents. This referential information is vital for resolving zero pronouns and improving machine translation outputs. Annotating honorifics is a complex task that involves identifying a predicate with honorifics, assigning ranks to referents of the predicate, calibrating the ranks, and connecting referents with their predicates

    Zero Pronoun Resolution in a Machine Translation System by Using Japanese to English Verbal Semantic Attributes

    No full text
    A method of anaphoral resolution of zero pronouns in Japanese language texts using the verbal semantic attributes is suggested. This method focuses attention on the semantic attributes of verbs and examines the context from the relationship between the semantic attributes of verbs governing zero pronouns and the semantic attributes of verbs governing their referents. The semantic attributes of verbs are created using 2 different viewpoints: dynamic characteristics of verbs and the relationship of verbs to cases. By using this method, it is shown that, in the case of translating newspaper articles, the major portion (93%) of anaphoral resolution of zero pronouns necessary for machine translation can be achieved by using only linguistic knowledge. Factors to be given special attention when incorporating this method into a machine translation system are examined, together with suggested conditions for the detection of zero pronouns and methods for their conversion. This study considers four factors that are important when implementing this method in a Japanese to English machine translation system: the difference in conception between Japanese and English expressions, the difference in case frame patterns between Japanese and English, restrictions by voice and restriction by translation structure. Implementation of the proposed method with due consideration of these points leads to a viable method for anaphoral resolution of zero pronouns in a practical machine translation system

    A system of verbal semantic attributes focused on the syntactic correspondence between Japanese and English

    No full text
    Relation-- __ Mental State -- Natnre ACTION Become Cause Enable _ Physicd__ Aclion Mental Start~l{nd---[ Start End Existence Attribute Possession Relative Relation Rclatkm of Cause and Effect . Percepthal State Emofive State Thinking State -- Physical Transfer -- Possessive 'l'rans: Attribute Transtar - Bodily Transfer Resnit Bodily Action Use Connective Action- |h-oducfion __ Extinction- lstrucfion Men Transfer Perceptual Action Erectire Action Thinking Action 4. Result of Application for the Semantic Descriptions of Verbal Patterns We evaluated the coverage of the verbal semantic attributes shown in chapter 3 by examining the verbal semantic attributes for each Japanese to English pair (about 15,000 pairs) in tile Japanese to English transfer pattern dictionaries 3

    An implemented description of Japanese : the Lexeed dictionary and the Hinoki treebank

    No full text
    In this paper we describe the current state of a new Japanese lexical resource: the Hinoki treebank. The treebank is built from dictionary definition sentences, and uses an HPSG based Japanese grammar to encode both syntactic and semantic information. It is combined with an ontology based on the definition sentences to give a detailed sense level description of the most familiar 28,000 words of Japanese.Published versio
    corecore